Goto

Collaborating Authors

 product-quantized code


PQk-means: Billion-scale Clustering for Product-quantized Codes

#artificialintelligence

PQk-means is a Python library for efficient clustering of large-scale data. While k-means clustering is slow and not much efficient to handle large scale data, PQk-means is an efficient clustering method for billion-scale feature vectors. In terms of PQk-means, it achieves its speed and efficiency by first compressing input vectors into short product-quantized (PQ) codes. If openmp is installed, it will be automatically used to parallelize the algorithm for faster calculation.